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Abstract — In this paper, we consider distributive subband, 
power and rate allocation for a two-hop transmission in an 
orthogonal frequency-division multiple-access (OFDMA) cellular 
system with fixed relays which operate in decode-and-forward 
strategy. We take into account of system fairness by considering 
weighted sum goodput as our optimization objective. Based 
on the cluster-based architecture, we obtain a fast-converging 
distributive solution with only local imperfect CSIT by using 
decomposition of the optimization problem. To further reduce 
the signaling overhead and computational complexity, we propose 
a reduced feedback distributive solution, which can achieve 
asymptotically optimal performance for large number of users 
with arbitrarily small feedback overhead per user. We also derive 
asymptotic average system throughput for the relay-assisted 
OFDMA system so as to obtain useful design insights. 

Index Terms — relay, OFDMA, resource allocation, fairness, 
distributive algorithm 

I. Introduction 

The relay-assisted OFDMA cellular system is a promising 
architecture for future wireless communication systems be- 
cause it offers huge potential for enhanced system capacity, 
coverage as well as reliability [1], [2]. Full-duplex relay 
stations were discussed in the early literature. However, full- 
duplex transceiver is hard to implement in practical systems 
because the relay has to transmit and receive at that same 
tim^H Since practical relays operate in a half-duplex manner, 
there is a duplexing penalty associated when using the relay 
to forward packets. As a result, it is not always advantageous 
to deliver packets via relay stations and it is very important to 
adapt the relay resource dynamically according to the global 
channel state information (GCSlfl" in order to capture the 
advantage of the relay-assisted OFDMA architecture. How- 
ever, perfect knowledge of GCSI is very difficult to obtain 
in both TDD (implicit CSI feedback) and FDD (explicit CSI 
feedback) systems due to the huge signaling overhead involved 
in delivering GCSI to the controller. 
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'In practice, there are cross-couphng between the transmit path and the 
receive path in any transceiver circuit. Hence, when the relay transmit (with 
high power), there will be some leakage into the receiver path, which will 
cause strong interference to the received signal (which is much weaker than 
the transmit signal). 

^Global CSI refers to the CSI of the base station (BS) and relay (RS), the 
CSI of the RS and mobile (MS) as well as the CSI of the BS and the MS. 



There are a lot of research interests focused on improving 
the system throughput by the optimal resource allocation 
of the relay-assisted OFDMA system. For example, in [3], 
[4], optimal subband allocation is considered for different 
scheduling schemes based on equal power allocation across 
all the subbands. In [5], heuristic separate subband and power 
allocation schemes are considered to improve the system 
capacity. To further improve the system throughput, joint 
subband and power allocation is proposed in [6], [7]. Due to 
the subband allocation constraint in the OFDMA system, the 
joint optimization problem is a NP-hard integer programming 
problem and continuous relaxation as well as graph theoretical 
approach are used to tackle the problem. However, as a 
simplification in both [6], [7], it is assumed that the source- 
relay, source-destination and relay-destination links use the 
same subband. They cannot effectively exploit the multiuser 
diversity, which is a very important component to system 
performance. While these works provide important initial in- 
vestigations on the potential benefit of relay-assisted OFDMA 
systems, centralized resource allocation solutions and perfect 
knowledge of GCSI are required. In general, there are still 
several important remaining technical issues to be resolved in 
order to bridge the gap between theoretical gains and practical 
implementation considerations. They are elaborated below. 

• Distributive Implementation: There are two potential is- 
sues associated with centralized implementations, namely 
the complexity issue and the signaling loading issue. 
For instance, the centralized joint optimization has a 
huge computational loading to the BS. Similarly, large 
signaling overhead is needed to collect the GCSI (BS- 
RS, BS-MS, RS-MS) from the RSs and MSs as well 
as to broadcast the scheduled results to the RSs and 
MSs. In [8], the authors proposed two semi-distributed 
sub-optimal subband allocation methods based on equal 
power allocation, which have offloaded certain com- 
putational load from the BS, but substantial signaling 
overhead is still needed to collect the GCSI from the 
RSs. 

• Imperfect Knowledge of GCSI: While all the above 
works assume perfect GCSI knowledge at the transmitter, 
the CSIT measured at the transmitter side cannot be 
perfect due to either the CSIT estimation noise or the 
outdatedness of CSIT resulting from duplexing delay. 
When the CSIT is imperfect, there will be systematic 
packet errors (despite the use of strong channel coding) as 
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long as the scheduled data rate exceeds the instantaneous 
channel mutual information. As a result, rate adaptation 
(in addition to power and subband allocations) must be 
considered in systems with imperfect CSIT in order to 
take into account of potential packet errors due to channel 
outage [9], [10], [11], [12]. 

• Fairness Consideration: Most of existing works only 
focus on sum-throughput maximization. However, fair- 
ness among users in the system is also an important 
consideration. In particular, it is important to study the 
potential fairness advantage of relay-assisted OFDMA 
systems as well. 

• Analytical Performance Results for Design Insights: In 
all the existing works above, the system performance is 
obtained by simulations only. However, it is very difficult 
to obtain useful design insights (e.g. how the system 
performance scales with system parameters such as A'l, 
K as well as path loss exponents of BS-RS and RS-MS 
links) without analytical performance results. 

In this paper, we shall attempt to shed some lights on the 
above issues. We consider a two-hop relay-assisted OFDMA 
system in a single cell with one base station (BS), M relay 
stations (RS) and K mobile stations (MS). In addition, we 
account for the penalty of packet errors due to imperfect CSIT 
by considering system goodput (b/s/Hz successfully delivered 
to the MS) as our performance measure instead of traditional 
sum-ergodic capacity (which only measures b/s/Hz sent by 
the BS or RS). We take into account of system fairness 
by considering weighted sum goodput as our optimization 
objective (which includes proportional fair scheduling (PES) 
as a special case). Based on the cluster-based architecture, we 
obtain a fast-converging distributive solution with only local 
CSIT using careful decomposition of optimization problem. 
For the downlink implementation, to further reduce the sig- 
naling overhead and computational complexity, we propose 
a reduced feedback distributive solution and show that only 
an arbitrarily small feedback overhead per MS is needed 
to achieve asymptotically optimal performance for large K. 
Finally, we also derive asymptotic average system throughput 
for the relay-assisted OFDMA system so as to obtain useful 
design insights. 

II. System Model 

In this section, we shall describe the cluster-based system 
architecture, the imperfect GCSI model and the system utility. 

A. Cluster-based Architecture and Channel Model 

FigHI illustrates the system model of the relay-assisted 
OFDMA system with one BS, M fixed RSs and K MSs. The 
coverage area is divided into M + 1 clusters with cluster 
served by the BS and cluster m {m G {1, A/}) served by the 
mth RS. The K MSs are assigned to one of the A/ + 1 clusters 
according to their large scale path los^ The number of MSs 
in cluster m is Km- MSs in cluster will receive downlink 

^The fcth user is assigned to tiie cluster witii tlie strongest received pilot 
strength. 



TDD Frame Timing 

Uplink CSI Use uplink CSI estimation as downlink CSIT 




Fig. 1. System Model 



packets or transmit uplink packets directly from or to the BS, 
and MSs in cluster m (rn G {1,A/}) will rely on the jTith 
RS forwarding the downlink or uplink packets in the packet 
transmission process. Let /Cm (m G {1,A/}) denote the set 
of MSs in cluster m, and ICq denote the set of MSs in cluster 
and the RSs. For notation convenience, we assume index k 
in cluster denotes the fcth MS if fc G {1, Kq} and the mth 
RS if fc = Ko + m {m G {1, A/}). 

We consider frequency selective fading where there are 
N independent multipaths. OFDMA is employed to convert 
the frequency selective fading channel into T orthogonal 
subcarriers with N independent subbands. The RSs operate in 
a half-duplex manner using decode-and-forward (DF) strategy. 
In order to facilitate relay-assisted packet transmission, a 
physical frame in the downlink is divided into two phases: 

• In phase one, the BS transmits data to the MSs and the 
RSs belonging to cluster 0. 

« In phase two, the BS stops transmitting while the RSs 
(which have successfully decoded data from the BS in 
phase one) will forward the data to the MSs belonging 
to their own clusters. 

Similarly, the two phases of a physical frame in the uplink are 

• In phase one, the MSs belonging to the relay clusters will 
transmit data to their own RSs. 

• In phase two, the MSs and the RSs (which have suc- 
cessfully decoded data from their MSs in phase one) 
belonging to cluster will transmit the data to the BS. 

Two orthogonal frequency spreading codes are assigned to 
adjacent RS clusters to mitigate potential mutual interference 
between them as illustrated in FiglJ|. Since the path loss 
between non-adjacent RS clusters are usually quite large, 
there is practically negligible mutual interference between 
non-adjacent RS clusters. As a result, all the RS clusters 
can operate simultaneously on the entire frequency band with 

■^To achieve the orthogonal signal separation in the frequency domain, 
signals from/to different RSs must be synchronized in the OFDM symbol 
boundary. This requires timing synchronization within the unused cycHc prefix 
in the OFDMA system and is implementable in practice. As a result, it is a 
common assumption in OFDMA systems such as WiMAX. 
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negligible interferenc^fl. 

In two phases (m = for the phase one of the downlink 
(phase two of the uplink) and m G {1,A/} for the phase 
two of the downlink (phase one of the uplink)), the received 
symbol Y„i^k,n carrying user k' information in cluster m in 
the nth subband is given by 



Ym,k,7i — y/Pm , , n ^ m , -^m ,k.n -^m .k.n ^rn.k.n ^ {0; A/j- 

where X^^k.n is the transmitted symbol from (to) BS (m = 0) 
or the mth RS (m e {1, A/}) to (from) user fc (fc G /Cm) in 
cluster TO in the nth subband, Pm,k,n is the transmit SNR, 
lm,k is the path loss, and Z„i,k,n ^ CJ\f{Q, 1) is the noise 
in the nth subband. We consider Rayleigh fading and hence, 

Hm,k,n ~ CA/'(0, 1). 



B. Global Channel State Information Model 

In this paper, we consider a relay-assisted cellular system 
with large coverage, and hence, both small scale fading (due 
to multipath) and large scale fading (due to path loss) are 
considered. We consider the block fading channel where the 
small scale fading coefficient is quasi-static within each frame 
and may be different among frames. We consider the resource 
allocation for the low-mobility users. Since the time scale for 
mobility is much larger than that for small scale fading, the 
path loss remains constant for a large number of frames and 
can be estimated with high accuracy^. The CSIT of small scale 
fading can be obtained from either explicit feedback (FDD 
systems) or implicit feedback (TDD systems) using reciprocity 
between uplink and downlinlfl We consider TDD systems, and 
assume perfect CSIR and imperfect CSIT due to estimation 
noise on the uplink pilots or duplexing delay as illustrated in 
Fig. [T] The estimated CSIT in phase one and phase two can 
be modeled as 

k,n m e {0, A/} (1) 

where Hm.k.n is the actual CSI, and ISHm.k.n ^ CM{0, a^) 
is the CSIT error. We denote the set of local imperfect CSIT 
of cluster to as H„i — {Hm k,n\k € /Cm,Vn}, and the global 
imperfect CSIT as H ^ Um=o -^m- 



C. System Policy 

In thispaper, we consider joint subband, power and rate 
allocatioqj. The system policies are defined below. 

'For tractable analysis, we assumed all RSs are separated either spatially 
or on the code domain. Yet, the effect of mutual interference is taken into 
consideration in the performance simulation. 

^For example, in 802. 16e system, frame duration is 5ms. Users with 
pedestrian mobility 5 km/hr will have coherence time at least over 20 frames. 

^In practical systems, like IEEE 802. 16e, a mechanism named "channel 
sounding" is proposed to enable the BS to determine the BS-to-MS channel 
response under the assumption of TDD reciprocity. 

*For both the uplink and downlink transmission, the resource allocation is 
performed at BS and RSs. 



3 

1) Subband Allocation Policy S: Let f. E {Oil} be 
the subband allocation indicator at the BS for MS k in cluster 
m {k E {1, A'o} when m = 0, /s G /Cm when m e {1, A/}). 
Let J, „ e {0, 1} be the subband allocation indicator at the 
TOth (?n e {1, A/}) RS for user k {k € /Cm) in cluster to. 
The subband allocation policy is 

M 

S = {s^^,k,n, sl,k,n & {0, l}|Vn, ^ ^ S^,fc_„ 

m=l keKrn 

fc=i keiCm 

2) Power Allocation Policy P. ■ Let pm.fc.n be the scheduled 
transmit SNR at BS (to 0) and the TOth RS (to e {1, Af }) 
respectively to user k {k G /Cm) in cluster m in the nth 
subband. Let Pm be the total transmit SNR at BS (to = 0) 
and the TOth RS (to E {1, Af }). Let Pm.k be the total transmit 
SNR at the user k [k E /Cm)- The power allocation policy in 
downlink and uplink systems (Vto E {0, 7\/}) are 

N 

V^^ = {pm,fe,„ > I E E - ^™'^"^} 

n=l fcG/C,„ 
N 

V^^ = |pm,fe,„ > I ^Pm.k,n < PrnM,^m,k E /C„i | 
n=l 

3) Rate Allocation Policy TZ: Let rm.k,n be the scheduled 
data rate at BS (to = 0) and the wth RS (to E {1,A/}) 
respectively to user k {k E /Cm) in cluster to in the nth 
subband. The rate allocation policy is 

TZ = {rm,fc,„ > I Vn, ?n e {0, A/}, k E /C,„} 

D. Maximum Achievable Date Rate and System Goodput 

In this part, we shall introduce the system utility based on 
the notations and system policies defined in the previous parQ 

Given perfect CSIR, the instantaneous mutual information 
(bit/s/Hz) between the TOth transmitter (receiver) and the kth 
receiver (transmitter) (fc E /Cm) in the nth subband is given 

by 

Gm,k,n = C„il0g2(l + Pm,k,nlm,k\Hm^k,n\^) (2) 

where cq = 0.5 and Cm 0.25 (to E {1, A/}13 

Due to imperfect CSIT, the transmitter does not have 
knowledge on the instantaneous mutual information in Q 
and hence, the scheduled data rate at the BS and the RS 
might be larger than the corresponding mutual information, 
leading to packet outage. To take the potential outage into 

'Note that Po.fc.n. ''o.fe.n = ^0 + Ti) are the power and rate 
control variables for the mth the RS (m = 1, ■ ■ ■ ,M). ^ is the 

subband allocation variable for the direct link BS-MSfc with m = 0, 
k G |1, ■ ■ • ,Ki\}. , , are the subband allocation variables for 

indirect link BS-RS^-MSfc with m g {1, M}, k g Km-)- 

'"^ is due to duplexing penalty. ^ is due to duplexing penalty and the 
spreading code with code rate i used in RS clusters. For simplicity, we assume 
that the same amount of time resources is used for each phase. However our 
design can be directly applied on the system with unequal phase duration by 
changing the duplexing penalty ratio between phase one and phase two. 
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consideration and to guarantee the scheduling fairness, we con- 
sider the average weighted system goodput (average weighted 
b/s/Hz successfully delivered to the MSs) as our perfor- 
mance measure. The average weighted total system goodput 
is given by U^gp{S, 1^,11) = Ejj[;7u,gp(S, P, R, H)], where 
C/^ugp(S, P, R, H) is the conditional average total weighted 
goodput for given H, and is given by 

1 / " 

C/^,p(S,P,R,H) = - E 



m=l keKr, 



N 



ri=l 
N 

|H])} 

ri=l 

Ka N ^ 

+ Y.'^kY. ■^o,k,nro,kA'^ -Pr[Ca^k^„ < ro^k^n\il])] (3) 

k=l n=l ' 

where a™ denotes the weight of the MS fc in Cluster m, and 
< rm,fc^„|H] (V n, m e {0,M}, k G /Cm) is the 
conditional packet error probability of one-hop link for given 
global H. 

III. SuBBAND, Power and Rate Scheduling Problem 
Formulation 

In this section, we shall formulate the relay-assisted schedul- 
ing problem as an optimization problem maximizing the aver- 
age total weighted goodput Uu,gp{S, V, TV) subject to the target 
outage probability e. Note that optimization on U.^gpiS, V, TZ) 
w.r.t. policies (set of actions for all CSIT realizations) is 
equivalent to optimization on [/tugp(S, P, R, H) w.r.t. the 
actions S, P, R (S 5(H), P = 7'(H), R = n{U)) for 
each given CSIT realization. Hence, the optimization problem 
is given by 

Problem 1: {Subband, Power and Rate Optimization Prob- 
lem) 

max C/u,o„(S, P, R, H) 

S,P,R 

s.t. Pr[C,„,fc,„ < r,„,fc_„|H] = eMn^m e {0,M},k e /C,„ 

(4) 

Sm,fc.„, 4,fc.n e {0, 1} Vn, me{0,M},ke IC,n (5) 



M 



E E 

m—l h 

E 



m,k,n 

m=lk<^Km k=l 



fe=i 

lVn,m G {1,M} 



keK„ 



(6) 

(7) 
(8) 



Pm,fc,7i > V?!, m e {0, A/}, k e /C„i 

TV 

E E P"''!^''' ^ ^"^ ^ {0,M}(for DL) (9a) 

N 

Y,Pm,k,n < Pm,k, Vm G {0, Af } (for UL) (9b) 



n=l feeK 

AT 



n=l 



"Proportional fairness (PF) is a special case of the weighted sum goodput 
system utility. 



IV. Dual Problem and Distributive Solution 

Note that Problem 1 is a mixed integer real optimization and 
hence, is not convex. Brute-force optimization is NP-hard with 
exponential complexity in term of the number of subbands. In 
this section, we shall apply continuous relaxation technique 
[6], [3], [13], [14], to obtain asymptotically optimal solution 
as well as discuss the distributive implementation. 

A. Continuous Relaxation of the Integer Programming Prob- 
lem 

To perform continuous relaxation, we allow time sharing 
between users for each subband by relaxing subband allocation 
indicator to rational value between and 1, which describes 
the fraction of time a particular subband is occupied by 
a particular user. Mathematically, to apply the continuous 
relaxation, the constraint (|5]l is replaced by 



„B R 
m.k,n'> 7n.kA 



>0 V7i,m G {0,A/},fc G /C„ 



(10) 



After equivalent transformation and continuous relaxation. 
Problem 1 is approximated by the following convex optimiza- 
tion problem. 

Problem 2 (Relaxed Optimization Problem): 



M 

max y , 
s,p,t ^ ^ 

m=l feeK™ 
N 

s.t. C<5]s 

n=\ 
N 



N 



"fc tk + 



\ " „0 \ " R 



I n 



k=\ 



n = l 



m,k,n m,k,n 



G {l,M},ke /C„ 



(11) 



tT <J2<kA,n me{l,M},kelC^ (12) 

Tl=l 

as well as constraints in ([T0|, © , d?]) , © , ^ for DL 



or (l9bD for UL 
wheref*^ 

1 



Pm^k' ,nQm,k' .n > 
7B / 



,m G {0,Af} 



(m = 0, fc' = fc G {1, A'o}; m > 0, k' = Ka + m, k e /C,„) 



=^log2 (1 



Pm,k,n9m,k,n \ 



G {l,M},k G /C„ 



(13) 

Note that Problem 2 is a convex problem. For the objective 
function, the first part X]to=i ^keic '^™^™ 1^ linear in t™, 
and the second part J^kli "° E^=i sl^,k,Ji,k,n is a positive 
linear combination of functions of the type f{x,y) — x log(l+ 
y/x), ?/ > 0, x > 0. According to Lemma 1 of [14], the second 
part is concave in {s^ .^,pa,k.n)- Therefore, the objective 
function is concave. Similarly, after moving the R.H.S. of 

I2p-1 

random variable with 2 degrees oi freedom and non-centrality pai'ameter 
J^m.fc,nP/^o"e- GD ^an be derived from the conditional PER constraint 
(4), the expression of instantaneous mutual information (5) and the non-central 
chi-squai'e distribution of | H„^ k given ^ „ , which is omitted due to 
page limit. 
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constraint ( fTTT ) and ( fTSl i to the L.H.S., it can be proved that 
the inequaUty constraint functions are convex. In addition, the 
left equahty constraint functions are linear. Hence, Problem 2 
is a convex optimization problem. 

The continuous relaxation in Problem 2 does not necessarily 
yield a solution where all the subband allocation indicators 
are or 1. However, they are or 1 with high probability 
due to the property of marginal benefit of extra bandwidth 
defined in Appendix B. If the integer solution is required, we 
can round the fractional values to or 1 . In fact, under some 
mild condition oo)0, the solution of Problem 2 will be 

identical to that of Problem 1 when we do subcarrier allocation 
in both problems. 

Remark 1: Let be the number of subcarriers in each 
independent subband. We assume the channel gains of all the 
subcarriers in one subband are highly correlated. Suppose a 
user is assigned 7i(0 < n < ■^) subcarriers in a particular 
subband during one transmission. We can interpret n/^ as the 
frequency sharing factor in this subband, which takes discrete 
value from to 1. When ^ oo , the frequency sharing 
factor can take any rational between and 1 . It can be achieved 
by the further subcarrier allocation within the subband. 



B. Partial Dual Decomposition and Distributed Solution 

The subband, power and bit rate allocations for relay- 
assisted OFDMA system is a complicated optimization prob- 
lem. Existing approach [3], [6], [7] considered centralized 
solutions but these solutions have huge computational loading 
at the BS. Furthermore, these solutions require knowledge of 
GCSI which induces heavy signaling overhead in the system. 
In this section, we shall derive a low-complexity distributive 
solution from Problem 2 using decomposition techniques. 

Using convex optimization techniques (details in Appendix 
A), the dual function of Problem 2 can be simplified as follows 



and RSs for uplink). 



Ko N 



M 



N 



fc=l n=l 
M 



r)i=lfce/Cm n=l 
N 



+ ^ ^ i^T ~ ^T) ^ ^m.k,n^m,k,n 
rn — 1 k^Kjn n—1 

Si. (O, ®, (0), ®, (HHfor DLor(l9b|)for UL 



(14) 



In what follows, we apply dual decomposition approach 
[15], [16]. The dual function can be decomposed into two 
subproblems: 

Subproblem 1: {Resource Allocations at BS in Phase One 
of Downlink (Phase two of Uplink)) Given M vectors of 
Lagrangian multipliers A"*, optimize the subband and power 
allocations at BS such that the weighted goodput is maximized 
subject to corresponding subband constraints at the BS, and 
power constraints at the BS for downlink (at the MSs in cluster 



'^The condition is quite mild and can be satisfied in most practical systems. 
For example, we have = 2048 and T fa 6 in WiMAX (802. 16e) systems. 



fce/c„, "^k 



k,n m^k'i 



S.t. S 



m,k.n 



r Z^fe=l "A; Z^n=l °Q.k.n' 0,k,n 

> 0, ® , ©, © or (HEKm = 0) 



Subproblem 2: (Resource Allocations at the rath RS in 
Phase Two of Downlink (Phase one of Uplink)) Given the 
vector of Lagrangian multipliers A"*, optimize subband and 
power allocations at the mth RS such that the weighted good- 
put is maximized subject to corresponding subband constraints 
at the 7Tith RS, and power constraints at the mth RS for 
downlink (at the MSs in cluster m for uplink). 



k£lC„ 



yuj, ^k ) Z^n=l °m,k,n' m.,k,n 



s-t- s^,fc,„ > 0, 0,Pm,fc,„ > 0, (j9i|or(l9b|(m > 0) 

There are AI subproblems of this kind, each one corresponding 
to the resource allocations at one RS. 

Subproblem 1 and Subproblem 2 are similar. Please refer 
to the Appendix B for the optimal solution. With the dual 
function mentioned above, the dual problem is summarized 
below: 

Problem 3 (Dual Problem): Find the optimal dual variables 
which maximize the dual function 

M 
■m—1 

s.t. =^ A'" a'", m e {l,Af} 

where a"^ = (a^")^ ^^(mG{l,M}). 

We use the subgradient method [17] to update each dual 
variable as follows 



A™(* + 1) 



(15) 



N 



N 

/ , "vi.k 
n=l 



n m.k.'. 



where <5™(i) is a positive step size and [■\x"^ denotes the pro- 
jection onto the feasible set A'™ {A;"iO < A^" < a^"} (i.e. 
let A^(i + 1) = 0V[-] < 0, A^(ii + 1) = <'V[-] > and keep 
A™(i + 1) = [•] if [•] e X]^). We study the convex Problem 2 
by solving its dual problem with decomposition technique and 
subgradient method. Since the Problem 2 is convex and strictly 
feasibl43> Slater's condition holds. Therefore, duality gap is 
zero and hence, the primal variables S{\^{i), - ■ ■ ,A^(i)) 
and P{\^{i), ■ ■ ■ , {i)) will converge to the primal optimal 
variables S* , P* . Since the dual problem is always convex, 
the convergence of the proposed scheme is guaranteed [17]. 
For any initial value in the feasible set, the dual variable A™(i) 
will converge to the dual optimal A™* as i — > cxd. 

The distributive architecture of the derived solution is il- 
lustrated in Fig|2] Intuitively, for user k in cluster m (m e 

'■*For example, we can easily find ^. ^ , s^^ ^ „ > 0> which satisfy ^6) 
and jv), and fc n > 0, which satisfy )9at for downlink or (%) for uplink. 

By choosing i™ = min{ELl ^^,fc,„^-^.fe,„. ELi ^^.^.n^'^.^.n) " 
e(Ve > 0), we can obtain a strictly feasible solution to the convex problem 
2. 
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RSm 



\/ I 

Subproblem 2 

2"" hop 
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allocation for 

far users in 

cluster M at 

RSM 



Fig. 2. Decomposition Structure 



{1, ■ • ■ ,M}), the dual variable A™(i) can be interpreted as 
the equivalent weight in phase one for downlink (phase two 
for uplink), while a™ — A™ as the equivalent weight in phase 
two for downlink (phase one for uplink). Based on the weights 
assigned by the master problem, Subproblem 1 can be solved 
at BS with its local imperfect CSIT (Ho), and Subproblem 2 
for the mth RS can be solved at the mth RS with its local 
imperfect CSIT (H,„). Then the dual problem updates the dual 
variables at BS to reduce the difference between the scheduled 
data rate for a particular user in phase one and phase two at 
each iteration. 

The distributive implementation offloads great computations 
from the BS to M RSs. In addition, it saves large signaling 
overhead for collecting CSI of RS-MS links and broadcasting 
the scheduled results to RSs for resource allocations of RS- 
MS links. These advantages are highly desirable for practical 
implementation. 

V. Reduced Feedback Distributive Solution for 
Downlink Systems 

Compared with the centralized solution, the complexity and 
signaling overload in the above distributive solution are greatly 
reduced. However, in downlink systems, the overhead due to 
the feedback of CSI from MSs to their cluster controllers 
(the BS and RSs) and the signaling between the BS and RSs 
still grows linearly with the number of users Selective 
multiuser diversity (SMUD) has been proposed in [18], [19] 
for cellular system to reduce feedback overhead by imposing a 
local threshold-based screening. We shall extend the threshold- 
based mechanism to reduce the overall system feedback over- 
head of the distributive solution for relay-assisted cellular 
network with fairness consideration. The reduced feedback 
distributive solution is outlined belo\43- 

'^In uplink systems, since resource allocation is performed at the BS and 
RSs, which are the receivers, we do not have the issues of CSI feedback 
overhead. After performing distributive resource allocation, the BS and active 
RSs inform the con'esponding scheduled MSs with the scheduled subband 
and transmit SNR for transmission. 

""In practice (e.g. HSDPA, WiMAX, 802.16m, LTE), there is always a pre- 
allocated dedicated UL signaling channel associated with high speed downlink 
packet data access and DL resource allocation. So our feedback information 
from MS->RS and RS->BS can be transmitted over these available dedicated 
control channels. 



Algorithm 1 (Distributive Algorithm with Reduced Feedback): 

1) The controller of cluster m (m G {0, M}) finds out MS i 
with the maximum weight a™ among MSs in Cluster m, 
and broadcasts the feedback threshold 7^™ = a™Z™7™ 
to the MSs in its cluster 

2) Each MS in Cluster m compares and feedbacks its CSI 

in the nth subband iff a^;|T'|/i^„|2 > 7™. Then 
the local reduced user set and corresponding imperfect 
is available at each cluster manager. 

3) BS decides and broadcasts the initial multipliers 
{A'"(0)|m e {1,M}} in phase one. The initial mul- 
tipHers {ol^ — A"*(0)|m G {1,A/}} in phase two are 
available at RSs for subproblem 2. 

4) In the ith iteration, BS solves Subproblem 1 and each 
RS solves its own Subproblem 2. Each RS reports the 
scheduled data rate of users in its cluster to BS. 

5) BS updates the multipliers to + 1)} in 
phase one according to ( fTSi ). and broadcasts + 
!)}■ 

6) If the difference of the scheduled data rate in two phases 
for each user in RS clusters is less than a threshold, or 
the number of iterations has already reached a predeter- 
mined value, then terminate the algorithm. Otherwise, 
jump to step 4). 

In the above algorithm, the communication overhead be- 
tween BS and each RS grows linearly only with the total 
number of users in A/ + 1 reduced user sets. Hence, the feed- 
back load of this algorithm is much smaller than the directly 
distributive implementation with full feedback. Although this 
algorithm is in general suboptimal due to the existence of 
feedback outage, it can be proved in the following lemma 
that under some conditions, its performance will converge to 
that of the directly distributive implementation with all MSs 
transmitting feedbacks. Due to the symmetric situation in each 
subband, we only consider one subband and ignore the index 
n for simplicity. 

Lemma 1: {Feedback Outage, Feedback Load and Asymp- 
totic Performance) Assume that the weight of each user is not 
smaller than 1. Vv'ithout loss of generality, assume user i has 
the maximum weight a,™ in Cluster m. Define TJT = ^^^^ . 

1) Given the threshold 7"^ = a'l^l™'^™-, the outage prob- 
ability Pq" (the probability that no one feedbacks) and 
the feedback load F™ (average number of feedbacks per 
user) are given by = flf - exp(-T™7'"))and 
P"" = Tt: Sf=i(l - exp(-rfe™7™)) respectively. 

2) Let the upper bound of be P^{K^). If P^{K^) 



satisfies Po"(-^™) 
1 as 



as 
1 



^ and P^{Km)T^ 
00, choose 7'"(Po™(A:„,)) 
so that Pq" and 



max 

as Km — > 00. 
There is a tradeoff between feedback outage probability and 
feedback load by adjusting 7™. However, for sufficiently large 

"The reduced user set is made up of users who feedback at least in one 
subband. The channel coefficient of any user in this set will be treated as 
in the subbands without feedback from this user 
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Km, Algorithm 1 can achieve asymptotically optimal weighted 
average goodput at asymptotically zero feedback cost per user 
by choosing 7™ (f^ ( A',„ ) ) = log ■ 



Proof: Please refer to Appendix C. 



i-Pfr(^™)^ 



VI. Asymptotic Performance Analysis under PFS 
FOR Downlink Systems 

In this section, we shall focus on studying how the sys- 
tem performance of the scheduling algorithm derived in the 
preceding section for downlink systems grow with various 
important system parameters such as the number of RSs and 
the number of MSs. Specifically, we consider proportional 
fair scheduling (PFS) [20] performance in the limit situation 

— > 00 [21], [22] for sufficiently large K in the analysis. To 
obtain insights on the performance gains, we impose a set of 
simplifying assumptions. We assume each RS contains K MSs 
with A'o = 0. Furthermore, we assume line-of-sight link (with 
high gain antenna) between the RSs and the BS and hence, 
the throughput is limited by the second hop. Under above 
assumptions, we can substantially simplify the system model 
without affecting the asymptotic performance and hence, we 
can derive closed form results which would otherwise be 
impossible in general regimes. 

Let D be the radius of a cell. Path loss takes on the 
form PL{d) = d^", where a is the path loss exponent. In 
the relay-assisted system, assume there are AI RSs and K 
users uniformly distributed in each RS cluster. Assume equal 
power allocation to each subband at RS0 Accordingly, in 
the system without RS, we assume MK users in the same 
cell edge region as the relay clusters, which is equivalent to 
the total number of users in the relay-assisted system, and 
BS allocates equal power to each subband. The asymptotic 
performance under PFS of the two systems is summarized in 
the following lemma. 

Lemma 2: {Asymptotic Performance for the Systems with 
RSs and without RSs) For the system with M RSs and K 
MSs in each RS -cluster, the average asymptotic throughput 
for large K is E[T] = f(log^{^\nK) - a{log^D + 



log2 t - 21772 )j ■ For the system with MK MSs and no RSs, 
the average asymptotic throughput for large K is _E[T('')] = 



log2(f lnMi^)-a(log2i? 
where t 



l-(l-2t)- 



log2 rr2t 21772) 



1+sin -jj ■ 

Proof: Please refer to Appendix D. ■ 
Define the gain of relay-assisted design over the non-relay 
design as 5 = E\r\ - E\r^^^\ = gf^r + QpU where 
g^sr = f (log2(^lnA')-alog2 D)- {\og^{?^\^{MK))~ 
a log2 D) is the throughput gain for frequency and spacial 

reuse and g^i = a(%z^log2 rzat " x log2 * + (t" " 

^) 21112 ) throughput gain for reducing energy reduction 

due to path loss. 

Remark!: Since g^sr = ©(A/ In In if) - 0(lnln(7\f AT)), 
the throughput of relay-assisted system grows much faster than 

'^Since the number of MSs are large, equal power allocation is asymptot- 
ically optimal due to multi-user diversity. 




] Proposed Algorithm(sum 61.247) 
] SSA(sum 30.800) 
] MaxPF without RS{sum 12.209) 
] Round Robin wittiout RS(sum 6.26 



ll lllll 



6 

Users 



Fig. 3. Average goodput allocation for 10 users in a single cell scenario at 
BS transmit power 36 dBm, RS transmit power 36 dBm at N = 16. 



the system without RS, which is due to the spatial reuse of 
relay-assisted architecture. It can be shown that gpi > and 
increases with AI. This demonstrates the benefits of relay- 
assisted architecture on reducing energy reduction. 

VII. Simulation Results and Discussion 

In this part, we shall compare our distributive subband, 
power and rate allocation for relay-assisted OFDMA system 
with several baseline references. Baseline 1 refers to the 
weighted total goodput maximization version of Separate 
and Sequential Allocation (SSA), which is a semi-distributed 
scheme proposed in [8]. Baseline 2 refers to the maximum 
total weighted goodput scheduling without RSs. In Baseline 3, 
we consider Round Robin Scheduling with water-filling power 
allocation across the subbands. We apply PFS algorithm to 
keep track of the average goodput of each user, and consider 
its inverse as the weight for the three maximum total weighted 
goodput scheduling design. We use Jain's index as the fairness 
measure, which ranges from 1/K (worst case) to 1 (best 
case). In the following simulation results, the average sum 
goodput is obtained from the optimal scheduled goodput of 
each user. The cell radius is 5000m. 6 RSs are evenly located 
on the circle with radius 3000m. We set up our simulation 
scenarios according to the practical settings in IEEE802.16m 
systems [23]. The carrier frequency is 2GHz. BS/RS height 
is 32m and MS height is 1.5m. The operating bandwidth is 
lOMHz with 2048 subcarriers and 8, 16 or 24 independent 
subbands. The path loss model of BS-MS and RS-MS is 
128.1 + 37.61ogio(i?) dB, and the path loss model BS-RS 
is 128.1 + 28.81ogio(-R) dB {R in km). The receive antenna 
gains of MS is dB, and the directional receive antenna is 
used at RS with antenna gain 20 dB. The lognormal shadowing 
standard deviation is 8 dB, and the CSIT error variance Cg is 
0.01. 

A. System Performance of the Distributive Algorithm 

FigS illustrates the average scheduled goodput allocation 
of the 10 users in a single cell scenario, the positions of 
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Number of Iterations 

Fig. 4. Average total weighted goodput in a single cell scenario for primal 
problem and dual problem of 10 users (N=24, M=6, K=10) and primal 
problem of 4 other cases (N=8, M=8, K=20,30 respectively) versus the number 
of iterations at BS transmit power 36 dBm, RS transmit power 36 dBm. (Due 
to limited space in the graph, we have omitted the dual values of the other 4 
cases.) 



which are generated according to uniform distribution. It can 
be observed that our distributive scheduUng algorithm can 
achieve much higher average goodput and fairness compared 
with three baseUnes. 

FigS illustrates the convergence performance of our dis- 
tributive algorithm in a single cell scenario. We plot the 
average best primal value curve and dual value curve within 
certain number of iterations for the 10 users case (N ~ 
24, M ^ 6, K ^ 10) and the best primal value curve for four 
other cases with iV = 8, M = 8, K = 20, Sclll respectively. 
It can be seen that our distributive algorithm converges quite 
fast. The performance at the 5th iteration is about 95% of the 
performance at the 50th iteration. Thus, good performance can 
be achieved with low overhead. 

B. System Performance versus Transmit Power 

Fig|6]illustrates the average sum goodput performance of the 
10 users in a multi-cell scenario with frequency reuse factor 
3 as illustrated in Fig. Inversus the transmit power at BS and 
RS. It can be observed that our proposed scheduling design 
has significant gain, especially in the lower SNR region. This 
is mainly because RS reduces the path loss greatly and our 
proposed algorithm utilizes the limited power more efficiently. 

Fig|7] illustrates the fairness performance of the 10 users 
in a multi-cell scenario versus the transmit power It can be 
seen that the designs with RSs keep much better fairness, 
especially in the lower SNR region. The main reason is that the 
differences of path loss of users in the cell are greatly reduced 
when the RS is half-way in between the BS and the MS. 
Compared with Baseline 1, our proposed design has similar 
fairness performance but a much better goodput performance 
as illustrated in Fig|6] 

"The convergence performance for the full feedback distributive algorithm 
with 30 MSs. is actually the same as the convergence performance for the 
reduced feedback distributive algorithm with 150 MSs under only 5% goodput 
performance penalty. 




Fig. 5. Multi-Cell Scenario with frequency reuse factor 3 (The number in 
the cell center shows the frequency band used by that cell.). Two orthogonal 
frequency spreading codes are assigned to adjacent RSs to achieve the 
orthogonal signal separation in the frequency domain (blue and yellow parts 
represent two frequency spreading codes adopted in RS clusters.). The cell 
radius D is 5000m. 6 RSs are evenly located on the circle with radius 
Dm, =3000m. The distance between the BSs of the nearest co-channel cells 
is _R = 3Z). The distance between the RSs of the nearest relay clusters of the 
nearest co-channel cells is r = 2D. 




16 21 26 31 36 41 46 

Transmit Power (dBm) 



Fig. 6. Average sum goodput of 10 users versus BS/RS transmit power in 
a multi-cell scenario with frequency reuse factor 3 as illustrated in Fig. |5] at 
N = 16. 



C. System Performance versus the Number of Users K 

Fig [8] illustrates the average sum goodput performance ver- 
sus the number of users per RS cluster in a single cell scenario. 
The transmission in phase two of our design directly benefit 
from the increase in the number of cell-edge users. When BS 
and RSs have the same transmit power 36 dBm, the bottleneck 
of the performance is in phase one. Hence, the first curve dose 
not increase much with the number of users in RS clusters. 
When the transmit power at RSs is smaller than that at BS, 
i.e. 3 1 dBm, 26 dBm, the bottleneck shifts to phase two. Thus, 
the overall performance increases greatly with the number of 
users. 
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Fig. 7. Jain's fairness index of 10 users versus BS/RS transmit power in 
a multi-cell scenario with frequency reuse factor 3 as illustrated in Fig.|5]at 
N = 16. 



Fig. 9. Average sum goodput of 60 users per cell in a single cell scenario 
versus the average feedback load at BS transmit power 36dBm, RS transmit 
power 36dBm, 31dBm, and N = 8. 




Proposed Algorithm (RS 36(JBm) 
Algorithrr (RS 31iJBm) 
ProposerJ Algorithm (RS 26dBm) 
SSA (RS 36dBm) 
SSA{RS31dBm) 
SSA {RS 26dBm) 
maxPF without RS 
Round robin without RS 



10,; 
- 



6 8 10 12 

Number of Users per RS Clusler 



Fig. 8. Average sum goodput versus the number of users per RS cluster 
(6 RSs) in a single cell scenario at BS transmit power 36dBm. RS transmit 
power 36dBm, 31dBm, 26dBm, and N = 8. 



D. Tradeoff of Performance and Average Feedback Load in 
the Distributive Reduced Feedback Algorithm 

Fig|9] illustrates the average sum goodput performance for 
60 users per cell in a single cell scenario versus the average 
feedback load by implementing distributive reduced feedback 
algorithm. We use different feedback thresholds to generate 
different average sum goodput performance with different 
average feedback load. Our distributive reduced feedback 
algorithm can achieve good performance (e.g. 95 % of the 
distributive full feedback algorithm in FigUll with low feed- 
back load (e.g. 20 %) in real systems. 

VIII. Summary 

In this paper, we propose a cluster-based distributive sub- 
band, power and rate allocation for a two-hop transmission 
in a relay-assisted OFDMA cellular system. We take into 
account of potential packet errors due to imperfect CSIT 
and system fairness by considering weighted sum goodput 
as our optimization objective. Based on the cluster-based 



architecture, we obtain a fast-converging distributive solution 
with only local imperfect CSIT by using careful decomposition 
of optimization problem. Our solution could be applied to 
both UL and DL allocations. To further reduce the signaling 
overhead and computational complexity in downlink systems, 
we propose a reduced feedback distributive algorithm, which 
can achieve asymptotically optimal performance for large 
number of users with arbitrarily small feedback outage and 
feedback load. We also derive asymptotic average system 
goodput for the relay-assisted OFDMA system so as to obtain 
useful design insights. 

Appendix A; Derivation of Dual Function 

By relaxing the global coupling constraints ( fTTT i and ( fT2] ). 
we have the following dual function 

M Ko N 

' ' m=l feeC, fe=l n=l 



M 



N 



E E A^E^ 



B ~B 
m.k.n m.k.n 



m=l keK„ 
M 



n=l 
N 



+ E E -r(E<M' 

m— 1 k^K-m 71 — 1 

s.t.Hni, I©, O, (HI), (Hifor DLor ©for UL 

where \^ ^ (A^Da'^xi ^ 0, ^ (^^Da'^xi ^ [m e 
{1,M} are the vectors of Lagrangian multipliers. Firstly, we 
optimize over t. The part of dual function with respect to t is 
given by 



M 

go = max E E ("'^ 

m=l feeK™ 



0, 



oo, otherwise 

To make sure that the dual function is bounded above, we have 
A™ + i^™ = a™. Hence, we can simplify the dual function. 
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Appendix B: Optimal Solution to the Relaxed 
Problem 

For downlink systems, the subproblem 1 and 2 share the 
similar forrrFl as follows 



K 



N 



EV^ 1 , 9k.nPk.n-' 

fc=l n=l 



Sk.n logs (l + 



Sk,n 



N K 



S.t.Sfc^n > 0, 



k=l 



Sk.'. 



IVn, Pk,n > 0, ^^^^Pk.n < P 



n=l fc=l 



The Lagrangian function is given by L 



J2n=l J2k=lPk,n) + J2n=l ""(^ ^ J2k=l ^k,n) 



Ml 



dL ,ak 1 . 

= ^ Pk.n =Sk.n{ ) 

OPk,n M gk.n 



dL 

dsk.n 



Xk,n =ak log2 (l 



gk,nPk,n ^ 



Sk,n 



-Q-k- 



9k,nPk,n 



^k,n ~^ 9k,nPk,n 



(17) 



where Xk.n can be interpreted as marginal benefit of extra 
bandwidth [14]. By substituting (fTST i into ( [TtI i. we have 

Xk,n =aklog(l+gkMlt-^) ' 
H satisfies 



K N 

^ ^ V gfc^n 



(18) 



n=l 



^ can be obtained by the subgradient method. For a particular 
/i, if there is a unique k* = argmaxjXfc „} for some n, time- 
sharing will not happen in this subband. 

1, Xk.n = maxfc {Xk^n} > 
0, otherwise 



Sk,n = 



Since for each given fi, Xk.n is a function of the CSI gk,n 
of each user and they are independent random variable. As a 
result, there is probability for Xk.n = Xf^i with k ^ k . 
Hence, there is probability 1 that the solution of the relaxed 
problem corresponds to that of the original problerr^. 

For uplink systems, we use individual power constraint 
Y.n=iPk,n < Pk instead. Similarly, we have pk,n = 



^k,n 



where /i^ satisfies ^^=1 ^k. 



'Ofc _ 

-^^) = Pk and Sfe „ can be obtained with the same method 
as the downlink. 



Appendix C: Proof of Lemma 1 

For notation convenience, we only consider MSs in one 
cluster and omit cluster index m in the following proof. Let g^. 
(= lk\hk\'^) be the channel gain of user k. Define Xk = akgk- 
Given > aj > IVj ^ i, we first show that if 3j such 



that Xi > Xj , then subband will not be allocated to user j 
according to Appendix B. 



> 



Xi > Xj,\/j ^ i 



1 _i>0>i-i 

jJ. Xi — jJ. Xj 

0>i-i>i_i 

— Xi ^ Xj 



In case 2 and 3, pj — 0. It is obvious that user j can not 
transmit in this subband. We only need to consider case L 

Vfc- if ^ - > 0' ^feK' ^>^) - "fc log - - 
Now, we shall show that Xk{akTXk) increase with ak and Xk 
separately. Let tk = Xk/fi, ^ = logt^ - (1 - ^ f{tk) 



and ^ = ""^r^ 

oaJfc xi 



Then i - J- > ^ tfc > 1 ^ /{tk) = 



(16) 77 - *^ > ^ fiik > 1) > f{tk = 1) = ^ Iff > 0. In 



— ::n — 



> 



Mi 

9ak 

dx. 



addition, afe > 1, > /i 

~ k 

0. Thus, ai > aj, Xi > xj > ^ Xi{ai,Xi) > Xj{aj , xj). 
According to Appendix C, this subband will not be allocated 
to user j. 

Given the threshold ^th = ctihj, user k will feedback its 
channel quality to the cluster manager iff akgk > ctikj =^ 
Ik = \hk\^ > Tki, where Tk ^ R.V. ^k denotes the 

feedback action of user k in one subband. = 1, if user k 
feedback its channel quality; ^fe = otherwise. For Rayleigh 
fading assumed in the context, Pr[^fc = 1] = Pr[jk > 2I.7] = 
Pr[Cfc = 0] = Pr[jk < Tkj] = 1 - e"^^^. 

Intuitively, we can consider as the overall channel 
gain of a potential user who shares the same weight ai 
as user i and serve as a threshold user The scheduling 
is done among the feedback user set and the threshold 
user If 7 < scheduling is still optimal. If 

7 > I /lip and X]fe=i ^fc ^' '■^^ scheduling is optimal 
except for the threshold user is scheduled at last, which 
happens with low possibility. If J2k=i ^k — 0, we declare 
a scheduling outage and suffer from certain performance 
loss. The outage probability is Pq = Pi'EILi ^k = 0] = 
J|^j^(l — cxp(— TfeT-)). The average feedback load is 
F - iE[j:t,i,] = ^Ef=i(l - e(-Tfc7)). We have 
cxp(- max{rfe}7) < cxp(-rfc7) < cxp(- min{rfc}7) 
Po < (1 - cxp(-max{Tfc}7))^Si^ < exp(- min{Tfc}7). 
(1 - cxp(-max{Tfe}7))^ = P^"iK) ^(P^'iK)) = 



raax{Tfc} 

exp( 



log- 



min {Tfc} 



^{Tk} 



log- 



cxp(-min{rfc}7) 



^) = {l-PS'{K)iy 



Since Pq"(A') ^ as K 00, we have Po ^ 0. Because 

00, we have P ^ 0. Hence, 
aJi7(^o"(-^)) 1^ asymptotic optimal threshold. For example, 
P™(if) = j7 satisfies all the conditions. 



^"For subproblem I, different (m, A;) pairs can be treated as different k in 
this form. Subproblem 2 for a given m is equivalent to this form by omitting 
index m. 



'if 3fc 7^ fc such that 



X,, 



is maximum, s^. „ can be obtained 



by solving a set of equations X]fe=i n = 1 and X^. „ 
sharing subbands and 



for the 



Appendix D: Proof of Lemma 2 



By applying the similar approach as in [21], throughput 
(bit/s/Hz) for MS k g ICm in relay-assisted system and in the 
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system without RS are: 



1 



log2 (1 



P 

vn 



kxdF^^„,K{x)) 



log2 



p 



-Im i- lnK] for large A' 



(6) 



7n,k 



AK 

log2 (^^S. In(Afi^)) for large K 



N 



where Fn,ax,K(a;) and F, 



2 



are cdf of 



^,Vfc} and niax{|iJ,„.fc,„|^, Vto, A:} separately, 
J. are path loss from the RS m and BS for the 



max{|iJ™^fc 

,7 

lm,k and Z'-''^ 
MS fc e /C,„ separately. 

Next, we consider the average throughput over distance. To 
get closed form solution, we use closed disc with radius Dm 
to approximate the relay clusters. The the relation between 

Dm) sin ^ = Dm Dm = 
tD. Assume MSs are uniformly distributed in 



D and Dm is given by {D 

sin 
1+sin - 



-D 



a relay cluster and the system without RS. Therefore, we get 
the average total throughput in relay-assisted system and the 
system without RS for large K in Lemma 2. In addition, it 
can be easily shown that ^g- > 0. 
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